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1.  Introduction 


The  failure  rate  function  h(x)  =  f (x)/(l-F(x) )  ,  corresponding 
to  a  distribution  F  with  density  f  ,  is  one  of  the  most  important 
parameters  in  reliability  theory.  A  problem  of  considerable  interest, 
especially  to  inventory  theorists,  logistics  planners,  reliability  en¬ 
gineers,  and  seismologists,  is  the  estimation  of  h  from  a  sample  of 
n  independent  and  identically  distributed  lifetimes  X.,,..,X  . 

Several  nonparametric  methods  for  estimating  h  have  been  proposed  in 
the  literature;  many  of  these  methods  are  based  on  the  assumption  that 
h  is  a  monotonic  function  of  x  .  Whereas  such  an  assumption  may  be 
realistic  in  many  applications,  estimators  of  h  which  are  not  based 
upon  any  assumptions  about  h  ,  appear  to  be  more  palatable  to  a  large 
spectrum  of  users. 

This  paper  is  an  attempt  to  survey  the  various  nonparametric, 
and  non-Bayesian  methods  for  estimating  an  h  on  which  no  monotonicity 
conditions  are  imposed,  and  to  list  the  important  properties  of  these 
estimates.  Bayesian  methods  for  estimating  h  will  be  surveyed  in  a 
sequel  to  this  paper. 


It  is  to  be  emphasized  that  the  literature  on  the  topic  of  this 
paper  is  fairly  extensive  and  scattered,  ranging  from  reliability 
studies  to  mortality  studies  and  seismology.  Furthermore,  it  is  tech¬ 
nically  quite  detailed,  involving  a  host  of  regularity  conditions,  and 
therefore  difficult  to  synthesize.  Consequently,  a  complete  discussion 
of  each  method  of  estimation  is  not  undertaken  here.  It  is  hoped  that 
this  paper  will  serve  as  a  guide,  and  the  reader,  having  read  the  sur¬ 
vey,  may  then  be  motivated  to  read  those  individual  papers  which  cap¬ 
ture  his  or  her  interest. 

The  organization  of  this  survey  is  as  follows.  Section  2  per¬ 
tains  to  some  preliminaries  and  introduces  some  notation  and  terminol¬ 
ogy.  Section  3  briefly  discusses  a  graphical  method  of  estimating  h  , 
and  Section  4  summarizes  estimators  of  h  which  are  commonly  used  by 
biometricians,  with  a  special  emphasis  on  the  "life  table"  and  the 
"product  limit  estimates."  Section  5,  which  occupies  a  large  portion 
of  this  survey,  pertains  to  "kernel  estimates"  of  h  and  methods  of 
improving  these  estimates  using  the  "generalized  jackknife."  Section 
6  discusses  some  recent  results  involving  a  "naive  estimator"  and  its 
smoothing  using  kernels  of  fixed  and  random  band  widths.  In  both  Sec¬ 
tions  5  and  6  some  global  results  for  the  appropriate  estimates  are 
also  given.  In  Section  7,  the  "generalized  failure  rate  function"  is 
introduced  and  methods  of  estimating  it  are  outlined. 


2.  Preliminaries  and  Notation 

Let  X  be  a  nonnegative  random  variable  with  absolutely  continu¬ 
ous  distribution  function  F  ,  and  probability  density  function  f  . 

If  X  denotes  the  lifetime  of  a  physical  device  or  a  biological  or¬ 
ganism,  then  F(x)  is  the  probability  of  the  event  that  the  device  has 
failed  by  age  x  ,  and  R(x)  =  l-F(x)  ,  called  the  reliability  of  the 
device  to  age  x  ,  is  the  probability  that  the  device  survives  to  age 
x  .  The  failure  rate  function  h(x)  at  age  x>0  is  defined  as 


?(*> 

1  -  F(x)  5 


if  F  (x)  +  1 


-  2  - 


* 


h(x) 


(2.1) 
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The  function  h(x)  is  also  known  as  the  "hazard  function,"  the  "mortal¬ 
ity  intensity,"  the  "age-specific  death  rate,"  or  the  "force  of  mortali¬ 
ty,"  depending  upon  the  context  in  which  h(x)  is  used. 

The  failure  rate,  as  a  function  of  time  (age),  has  a  probabilist¬ 
ic  interpretation,  namely,  h(x)dx  represents  the  probability  that  a 
device  which  is  surviving  at  age  x  will  fail  in  the  interval 
(x  ,  x+dx)  .  On  the  basis  of  physical,  biological,  or  engineering  con¬ 
siderations,  one  is  at  liberty  to  choose  the  functional  form  of  h(x) 
for  a  particular  device  or  organism.  Once  this  is  done,  a  differential 
equation  in  h(x)  is  obtained,  from  which  f(x)  and  h(x)  can  easily 
be  determined.  In  fact,  one  can  show  [see  Barlow  and  Proschan  (1975), 
or  Mann,  Schafer,  and  Singpurwalla  (1974)],  that 

h(x)  =  [-&n(l~F(x)) ]  ,  (2.2) 

and 

f(x)  =  h(x)|exp  -  /  h(t)dtj  .  (2.3) 

\ 

The  purpose  of  this  paper  is  to  discuss  the  various  nonparametric 
and  non-Bayesian  methods  that  have  been  proposed  to  estimate  h(x)  ,  and 
to  state  the  properties  of  the  proposed  estimators. 


Let  X...  <.  X,„..  <  ...  <  X,  .  denote  the  ordered  values  of  a 
(1)  -  (2)  -  -  (n) 

random  sample  X  , ...,X  from  a  population  with  absolutely  continuous 
1  n 

distribution  function  F  and  density  f  .  Let  F^(x)  he  t*,e  empirical 

sample  distribution  function  of  X, ,...,X  ;  that  is, 

1  n 

F  (x)  =  —  (number  of  observations  among  X, ,...,X  <  x}  , 

n  n  1  n  = 

and 

R  (x)  =  —  (number  of  observations  among  X, ,...,X  >  x}  . 

n  n  °  1  n 


Throughout  this  paper,  the  following  abbreviations  will  be  used. 

If  (a  )  and  (b  }  are  two  sequences,  then  "a  ~  b  "  is  read 
1  nJ  1  nJ  n  n 

"a  is  asymptotically  equivalent  to  b  ,"  and  means  that  the  ratio  of  a 


-  3  - 
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to  b  has  limit  one.  The  notation  "a  ~  o(b  )"  means  that  the  ratio 
n  n  n 

a  to  b  has  limit  zero,  and  "a  =  Q(b  )"  means  that  the  absolute 
n  n  n  n 

value  of  the  ratio  is  bounded  in  the  limit.  The  terms  o(b  )  and 

n 

OCb^)  are  frequently  used  to  represent  some  unknown  function  of  n 
which  has  the  appropriate  property. 

A  sequence  of  random  variables  {X  (oi) }  defined  on  a  space  £2 

n 

is  said  to  converge  with  probability  one,  or  converge  almost  surely  to  a 

random  variable  X(w )  ,  if 

lim  X  (to)  =  X(io) 
n-*»  n 

for  almost  all  we  £2  ;  that  is, 

Pjw;  lim  X  (w)  =  X(w) }  =  1  . 
n-x» 

This  type  of  convergence  will  be  denoted  by  "X^+X  ,  w.p.l  ." 

The  sequence  of  random  variables  (X  (to)}  is  said  to  converge  in 

P 

probability  to  a  random  variable  X(w)  ,  denoted  by  "Xn  ■*  X"  ,  iff  for 
every  e  >  0 

lim  P<u>  :  jx  (to)-X(to)  |  ,>  e  >  =  0  . 

n-+°°  '  n  * 


Let  X  (to)  be  a  sequence  of  random  variables,  and  let  (b  }  be 
n  n 

any  other  sequence.  Then  the  notation  "X^  =  o^fb^)"  denotes  the  fact 

p 

that  (X  /b  )  0  ,  so  that  X  *  o  (1)  is  another  way  of  writing 

n  n  n  p 

P 

X  -*•  0  .  The  notation  "X  =0  (b  )"  denotes  the  fact  that  for  every 
n  n  p  n 

c  >  0  ,  there  exists  a  K  such  that 

G 


(  x  (Oi)  ) 

r 


>  l-e 


for  all  n  >  ,  where  is  some  integer  which  depends  on  G  . 
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If  the  sequence  of  random  variables  {X^O) }  have  corresponding 
distribution  functions  F^  ,  and  the  random  variable  X(w)  has  distri¬ 
bution  function  F  ,  then  the  sequence  {X^Ccu)}  is  said  to  converge  in 
distribution,  or  converge  in  taw  to  the  random  variable  X(w)  ,  denoted 
by  "X^  5  x"  ,  if  Fn (oo)  -»■  F(oj)  at  every  point  oj  in  the  set  of  con¬ 
tinuity  points  of  F  . 


3.  A  Graphical  Estimate 


Motivated  by  Equation  (2.2),  Watson  and  Leadbetter  (1964a)  have 

proposed  a  simple  graphical  estimator  h  of  h  ,  where 

8 

h  (x)  =  the  graphical  derivative  of  {-£n(l-F  (x))}  . 
g  n 

The  curve  -£n(l-Fn(x))  is  plotted  against  x  ,  and  a  smooth  curve 

drawn  through  the  points  by  any  reasonable  method.  The  slope  of  this 
curve  at  point  x  is  an  estimate  of  h(x)  . 

Clearly,  this  method  involves  an  unspecified  amount  of  smoothing, 
and  obtaining  the  slope  of  a  hand-drawn  curve  is  notoriously  difficult. 


4.  Estimates  Used  in  Biometrj 


Some  of  the  earliest  discussions  on  estimating  the  failure  rate 
are  in  studies  of  the  mortality  rates  of  animals  and  human  beings.  A 
survey  of  this  early  literature  is  given  by  Kimball  (1960).  Conspicu¬ 
ous  among  these  is  the  paper  by  Grenander  (1956),  who  obtains  the  maxi¬ 
mum  likelihcod  estimator  of  h  under  the  assumption  that  it  is  nonde¬ 
creasing. 


Suppose  that  the  domain  of  x  ,  [0,“)  ,  is  divided  into  k  in¬ 
tervals  [0,a^),  [a^o^),  ...»  •  Let  the  time  interval 


Ij  ’  be 


1^  =  a  -  ,  with  a0  =  0  , 


-  5  - 


.  .  ■»  . '  {  ■ 
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Let  n.  be  the  number  of  X.'s  which  fall  in  I.  ,  i=l,...,k  ,  and  let 
J  i  J 

+  i  be  the  age  at  death  of  the  Ith  individual  in  the  jtft  in¬ 
terval,  £=1, . . . ,n.  ,  with  0  <  t.  .  „  <  I.  ;  here  t.  ,  =  0  . 

1  3-1.*  J  3-1,0 

Kimball  (1960)  proposes  two  different  sampling  procedures  for  es¬ 
timation: 

(1)  the  time  intervals  are  chosen  arbitrarily,  so  that 

each  n^  associated  with  an  interval  is  a  random  variable; 

(2)  the  n^  are  preassigned,  and  the  1^'s  become  random. 

We  shall  first  consider  the  estimation  of  h  under  sampling 
scheme  (1). 

The  actuarial  estimate  h  (x)  of  h(x)  ,  for  x  £  [a.  ,  ,  a.)  is 

a  3-1  3 


ha(x) 


/a._1  +  a.\  n.  ^  . 

\  2  '  =  VV  (^Tn1-...-n.„1-*?ij)  5 


it  is  the  number  of  deaths  per  unit  time  in  the  interval  1^  divided  by 
the  average  number  of  survivals  in  the  interval. 


A  motivation  for  considering  h  is  based  on  the  fact  that  h 

H  3 

is  related  to  the  approximation 


h(x)  = 


dF£xl 

dx  1  AF(x 

l-F(x)  **  avg[l-F(x)]  Ax 


_i _ tDM)  - 

‘-POO,  Ax  -  I[2.p(x  +  I)-F(x-I)]  1 


where  avg(*)  denotes  the  average  of  (•)  . 


Another  estimate  of  h  based  on  preassigned  time  intervals 
has  been  proposed  by  Sacher  (1956),  who  apparently  bases  his  estimate 
on  the  fact  that 


-  6  - 


h(x) 


=  -  ^  £n[l-F(x)]  *  -  ^  £n[l-F(x)] 

-  Jtn[l  -  F(x  +  -|)]  -  *n[l-F(x -■§)])  /  I 

i  .  [1-p(x-i)" 

=  —  £n - r-  . 

1  [l"F(x+|)J 

This  leads  to  the  estimate  h  (x)  for  x  e  [a.  ,a.)  ,  where 

h,  3~1  3 


h  (x) 
bl 


1 


a.  -  a. 
J  J 


-1 


£n 


Whereas  h  is  almost  always  positively  biased,  h  tends  to 
bi  3 

be  negatively  biased  as  x  increases.  The  bias  of  both  estimates 

varies  with  I  ;  in  h  it  increases  monotonically ,  whereas  in  h  i 

3 

sometimes  increases  monotonically,  and  at  other  times  it  starts  off 
being  positive,  and  then  decreases  through  zero  through  negative  values 
Since  h^  is  essentially  a  maximum  likelihood  estimate,  it  has  desir¬ 
able  large  sample  properties.  However,  in  many  real  life  situations, 
expecially  those  involving  human  experiments,  the  sample  sizes  are  not 
large  enough  to  justify  this  argument. 


Under  sampling  scheme  (2),  originally  conceived  by  Moran  (1951), 
an  unbiased  estimate  of  h  has  been  proposed  by  Seal  (1954)  when  1^ 
is  small  enough  so  that 

h(cx.  ,  +  t)  =  h(a.  ,)  ,  0<T<I.  . 

j-1  3-1  =  3 

Seal's  estimate  h  (a.  )  is 

15  2  3  — 1 


n.  -  1 
_J _ 


-  k 


Cj-l,k+l  _  tj- 


l,kJ 


This  estimate  is  efficient  and  complete,  and  has  variance  approximately 
2 

equal  to  h  (a  )  /  (n,-2)  . 
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i 


I 


The  assumption  that  h(<*.  |  +  :)  -  li(u.)  ,  0<  i  <  ]  ,  ,  though 

suitable  tor  some  purposes,  lulls  to  bo  consistent  with  moat  dal  a  in 
biometry  which  suggest  that  h  is  usually  a  nondecreasing  function  of 
h  .  We  refer  the  reader  to  Kimball  (1960)  for  more  details  on  h 

S2 

and  modifications  to  it. 

4.1  The  Life  Table  and  Product  Limit  Estimates 

It  is  now  convenient  to  introduct  here  a  close  relative  of  the 
failure  rate  estimate;  the  "life  table  estimate  of  a  survival  distribu¬ 
tion,"  which  is  dominantly  featured  in  the  literature  on  survival  stu¬ 
dies.  A  key  feature  of  survivorship  and  life  testing  data  is  the  pres¬ 
ence  of  "censored"  or  withdrawn  observations.  Breslow  and  Crowley 
(1974),  in  a  seminal  paper  on  this  topic,  incorporate  a  model  of  random 
censorship  and  study  the  "standard  life  table  estimate"  using  grouped 
data.  The  notation  and  terminology  given  below  is  an  adaptation  of  that 
used  by  Breslow  and  Crowley,  who  in  turn  have  borrowed  it  from  Efron 
(1967). 

Let  X°  ,  n=l,...,N  ,  be  the  true  survival  time  for  the  nth 
n 

individual  included  in  the  life  table.  These  are  assumed  to  be  inde¬ 
pendent  random  variables  having  a  common  but  unknown  distribution  func¬ 
tion  F°  .  The  period  of  observation  or  follow-up,  for  the  nth  indi¬ 
vidual,  will  typically  be  limited  by  an  amount  Y^  ,  n=l,...,N  .  Under 

the  model  of  random  censorship,  Y  ,  ...,Y  are  a^so  assumed  to  be  a 
random  sample  drawn  independently  of  the  xjj  from  an  unknown  distri¬ 
bution  H  .  The  X°  are  said  to  be  censored  on  the  right  by  the  Y  , 
n  n 

since  one  observes  only 

X  =  min(X^  ,  Y  )  and  6=1  , 

n  n  n  n  [x0<y  , 

n=  n 

where  the  indicator  function  6  =  0  ,  if  X  is  censored,  and  6=1, 

n  n  n 

otherwise,  for  n=l,...,N  . 


V  '  . "  ’  *  , 


'*}+ 


-  8  - 
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The  observed  X  ' s  constitute  a  random  sample  from  the  distri- 
n 

bution  function  F  ,  where 

(l-F)  =  (1-H)(1-F°)  . 

The  distribution  function  F  of  an  uncensored  observation  is  called  a 
subdistribution  function.  Specifically, 

F(x)  -  Pjx  <x  ,  6  =lj  =  f  ( l-H(s))  dF°(s)  . 

'  '  0 

In  life  table  analysis,  interest  centers  around  the  distribution  func¬ 
tion  F°  . 


The  time  period  of  observation  [0,T]  is  partitioned  into  k 
intervals  I.  =  (£j  ^  ,  £  ]  ,  with  0  =  <  . . .  <  =  T  .  The  condi¬ 

tional  probabilities  of  failure  in  the  interval 


qi 


F°(C  )  -  F°(C  ) 


are  the  parameters  of  interest, 
failure  rate  for  small  values  of 


since  they  are  approximations  to  the 

If  p  *  1  -  q ^  ,  then  the  prob- 


I. 

1 


ability  of  survival  to  £  , 

J  m 


P 

m 


1  -  F 


(€) 

m 


m 

n 

j=i 


Let  ,  and  represent,  respectively,  the  number  of 

items  surviving  at  the  start  of  ,  the  number  known  to  have  failed  in 
Ij  ,  and  the  number  censored  (or  withdrawn  surviving)  in  1^  .  Further¬ 
more,  suppose  that  it  is  possible  to  subdivide  and  according 

to  whether  an  item  is  due  for  withdrawal  (N^  ,  or  not  ^ue  ^or 

withdrawal  (N^  and  D^)  1°  •  This  subdividion  is  possible  only 

if  one  knows  the  censoring  variables  Y  for  all  N  items,  knowledge 

n 

which  in  most  practical  situations  is  hard  to  come  by. 
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The  standard  life  table  estimate  (SD)  used  most  often  in  practice 
[Berkson  and  Gage  (1950),  Cutler  and  Ederer  (1958),  Gehan  (1969)]  is 
given  by 

D. 


Another  estimate  of  ,  the  reduced  sample  estimate (RS)  studied  by 
Kaplan  and  Meier  (1958)  is 


q  -Si 

qj  N.  . 


It  should  be  noted  that  only  the  RS  estimate  is  generally  con¬ 
sistent  for  q^  ;  however,  because  it  is  based  on  items  which  are  not 

due  for  withdrawal  in  1^  ,  it  is  not  commonly  used.  On  the  other  hand, 

the  SD  estimate  is  consistent  only  under  special  conditions  relating  the 

survival  distribution  and  the  censoring  distribution  H  .  These 

conditions  are  stated  more  precisely  in  the  following  theorem. 


Theorem  4.1  [Breslow  and  Crowley  (1975)]:  Let  the  censoring 
distribution  H  be  absolutely  continuous  with  density  h  on  [0,T]  . 

A  necessary  and  sufficient  condition  that  the  SD  estimate  yield  a  con¬ 
sistent  estimate  of  at  the  end  points  of  each  of  the  k  intervals, 

for  any  choice  of  interval  end  points,  is  that  F^  satisfy 

F°(x)  =  1  -  [ 1/ (1  +  cH(x) ) ] * 

for  some  constant  c  >  0  . 


Quite  often,  a  reasonable  approximation  to  the  censoring  dis¬ 
tribution  is  the  uniform  distribution;  thus  H(x)  =  x/T  ,  0<x<T  ,  in 
which  case  the  distributions  yielding  a  consistent  estimate  satisfy 

F°(x)  =  1  -  [l/(l+cx)] 2  .  However,  the  distribution  1  -  [l/(l+cx)]i 
has  a  failure  rate  function  which  decreases  in  time,  implying  negative 
aging,  and  this  may  be  unrealistic  in  many  situations. 
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Theorem  4.1  therefore  suggests  that  the  SD  estimate  is  generally 


inconsistent,  with  converging  to  q*  ,  where  q*  ^  q^  is  specified 


by  Breslow  and  Crowley.  The  asymptotic  normality  of  the  SD  estimate  has 
also  been  established  by  Breslow  and  Crowley,  who  also  point  out  that 


the  individual  q^'s  are  asymptotically  uncorrelated. 


An  estimate  £  of  P  ,  the  probability  of  survival  to  £  , 

mm  m 


is  given  by 


m 


P  =  n  (l-q  )  . 

m  j-1  J 


Since  the  estimates  q  .  depend  on  the  partition  0  <  £  <  E,  <  . . .  <  F  <  T 

J  J.  Z  K 


and  sample  size  N  ,  it  is  helpful  to  denote  the  dependence  of  P  on 

m 


these  choices  by  writing 
iO 


P  =  1 

m 


Pk,N(t)  for  t  C  »  ^m-1^  ’  m=1»2» • • • >k~l 


Kaplan  and  Meier  (1958)  introduce  the  now  extensively  discussed  product 

,  and  where  the 


Limit  estimate  (PL)  fP  of  ,  where  =  lim  F^  „ 

N  k-*x>  k’N 


right  continuous  limit  is  taken  under  any  nested  sequence  of  partitions 


such  that  sup  C  -  £,  ,  I  ->■  0  .  Kaplan  and  Meier  show  that  for 

Km<k  m  ra~1 


X(l)  -  X(2) 


<  ,  the  PL  estimate  can  be  written  as 

(N) 


i  -  ?°hm  -  _!  (i  -  ifij) 


(j) 


J-1 


where 


5* 

(j) 


1  ,  if  <  c  »  an<*  X(j)  is  uncensore£l» 


0  ,  otherwise. 


Strong  consistency  of  the  PL  estimate  is  shown  by  Peterson  (1977) 


50 


In  order  to  obtain  the  asymptotic  distribution  of  F^  ,  Breslow  and 


Crowley  define  the  stochastic  process  (z*(t)  ;  0<t<T}  ,  where  Z*(t) 

N  N 


rfi  (F°(t)  -  F°(t))  .  Then 
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TJioori'm  4 . 2  | Bros  low  aiul  Crowh'v  l.t>i  t  -  ...  s.itlslv 

F(T)  <  1  ,  and  suppose  that  F^  and  H  are  continuous.  Then  the  pro¬ 
cess  {Z*(t)  ;  0<t<T}  converges  weakly  to  a  mean  0  Gaussian  process 

(Z*(t)  ;  0<t<T}  whose  covariance  structure  for  s<.t  is  given  by 

g 

Cov(Z*(s)  ,  Z*(t))  =  (1-  F°(a))  (1-  F°(t))  /  ( 1  -  F(x) ) ~2  dF(x)  . 

0 

Hall  and  Wellner  (1980)  transform  the  above  weak  convergence  re¬ 
sult  to  a  Brownian  bridge  form,  and  then  obtain  uniform  confidence  bands 
for  F°  . 

An  alternate  method  of  estimating  F^  under  different  censoring 
schemes  is  given  by  Turnbull  (1976).  Wegraan,  Nour,  and  Kukuk  (1980) 
give  a  time  series  approach  to  life  table  construction. 


5 .  Kernel  Estimates 

Of  the  various  methods  used  for  estimating  h  ,  those  based  on 
smoothing  functions,  called  kernels ,  happen  to  be  predominant  in  the 
literature.  Such  estimators  are  known  as  "kernel  estimators  of  the 
failure  rate,"  and  it  appears  that  their  consideration  is  motivated  by 
the  popularity  and  success  of  kernel  estimators  of  the  (underlying) 
density  function. 

The  choice  of  the  kernel  is  very  important,  and  to  a  large  ex¬ 
tent  determines  the  properties  of  the  estimators.  Watson  and  Leadbetter 
(1964b),  in  an  insightful  and  fundamental  paper  on  this  topic,  intro¬ 
duced  a  sequence  of  functions  {6  (x) }  ,  which  they  called  a  6- function 

n 

sequence,  which  satisfies  the  following  conditions: 

(a)  6^  e  L1  (i.e.,  /|6n(x)|dx  <  °°)  ,  for  all  n  ; 

(b)  /5  (x)dx  =  1  ,  for  all  n  ; 

n 


I 
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(c)  6^(x)  0  >  uniformly  in  |  x  |  ^  X  ,  for  some  fixed  X>0  , 

(d)  /|x|>^  6^(x)dx  -►  0  as  n->-®  for  any  fixed  X>0  . 

In  summary,  { 6^ (x) }  is  a  sequence  of  smoothing  functions  (kernels) 

tending,  as  n  +  “>  ,  to  a  "Dirac-6  function."  Using  the  sequence 
{^n (X) ^  *  they  consider  the  following  two  estimators  of  h  : 


and 


where 


hn(x)  = 


f  (x) 


1  -  F  (x) 


n  6  (x-  X,..) 

h  (5>  -  i  J — — fii- 

n  .  n-j+1 

j  =  l 


?n(x)  6n(x“X(j)) 


and  F  (x)  =  /  f  (u)du 
n  0  n 


(5.1) 


(5.2) 


If  the  sequence  (6  (x) }  is  suitably  chosen,  that  is,  if  a  = 
n  n 

2 

/6q(x) dx  <  00  at  every  point  of  continuity  x  of  h  at  which 

/v 

F (x)  <  1  ,  then  h^(x)  is  shown  to  be  asymptotically  unbiased  with  an 
asymptotic  variance  given  by 

<5-3> 


If  in  addition  to  the  above  =  0(n)  ,  then  the  asymptotic  variance 

/\  A 

Var[h  (x)]  converges  to  zero  in  the  order  of  a  /n  ;  that  is,  h  is 
n  n  n 

consistent. 


Under  the  following  slightly  more  restrictive  conditions. 


(e)  =  /6^(x) dx  <  00  ,  /|6n(x)|  ^dx  <  00  ,  for  some  u  >  0  and 


a 


such  that 


nn/2 .  ai+n/2 

n 


2+nd 
0  ,  as  n  -*■ 1 
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(t)  ^ ^  U:|i  i(s>l  i  (s') !  *  n  .  m'l.'j  n  *  •"  ;  .ni,i 

(g)  (E[Fn(x)]  -  F(x) }  -  0  ,  as  n  +  ~  ; 

Watson  and  Leadbetter  (1964b)  show  that  the  random  variable 

ll_F(x)3[^TOo]2[fin(x)  “  h(x)1  (5-4) 

is  asymptotically  normally  distributed  with  mean  zero  and  variance  1,  at 
every  continuity  point  x  of  h(x)  . 

A 

In  contrast  to  the  asymptotic  results  for  the  estimator  h  , 

n 

small  sample  properties  of  h  have  been  obtained  by  Watson  and  Lead- 

n 

better  (1964b) ,  who  show  that 

E[h  (x)]  *  /6  (x-u)h(u)du  -  fS  (x-u)Fn(u)h(u)du  ,  (5.5) 

n  n  n 

and 

Var [h  (x)]  =  /62 (x-u)h(u)I  (F(u))du  + 
n  n  n 


+  2  /  f  6n(*-u)6n(x-v) 
0<u<v 

where 


l-F(v) 


fl-Fn 

_1-F( 


-F“(u) 

F(u) 


Fn(v)  - 


Fn(v)-Fn(u) 

F(v)-F(u) 


dF (u)dF (v) 


In(F)  =  r 
n  0  B 


dB 


To  study  the  large  sample  properties  of  h^  ,  a  further  condition 

is  required  which  restricts  the  class  of  distributions  considered.  Spe¬ 
cifically,  for  a  given  6-function  sequence,  a  class  Cg  of  distribution 

functions  F  is  defined  such  that  for  any  fixed  Xq  and  any  fixed 


X  >  0 


there  exists  a  G..  such  that 


l-F(x) 


<  G 


X 


for  all  sufficiently  large  n  ,  and  jx-Xg|  i  ^  * 
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Under  the  assumption  that  F e  ,  and  h  is  continuous  at  , 
with  F(xq) <  1  ,  one  can  show  that  an  asymptotically  un¬ 

biased  estimator  of  h(Xg)  .  If,  in  addition,  the  5-function  sequence 

2 

is  chosen  such  that  a  =  /5  (x)dx  <  00  ,  and  a  =  0(n)  ,  then 

n  n  n 

IS  (t)  Var(W’  ■  • 

The  assumption  =  0(n)  implies  in  particular  that 

Var[h  (x„)]  -*■  0  ;  thus  h  is  consistent,  and  in  fact  its  variance  cou¬ 
rt  U  n 

verges  to  0  in  exactly  the  same  way  as  a  / n  (i.e.,  Var[h  (x„) ]  = 

n  n  0 

0(a^/n)).  A  comparison  of  this  result,  with  that  given  by  (5.3),  sug¬ 
gests  that 

Var[hn(xQ)]  -  Var[hn(xQ)]  . 


In  Watson  and  Leadbetter  (1964a) ,  some  numerocal  investigations 
were  conducted  with  the  main  conclusion  that  in  practice,  the  estimators 
h  and  h  are  essentially  equivalent. 


Murthy  (1965)  proposes  two  estimators  Z  and  Z*  of  h  by 

n  n 

considering  5  (x)  =  B  K(B  x)  ,  where  B  is  a  nonincreasing  function 
n  n  n  n 

of  n  such  that  lim  B  =  <®  and  lim  B  /n  =  0  ,  and  the  kernel  K 

_  n  n 

n-*®  n-*® 

satisfies  the  conditions 


If 


K(x)  >.  0  ,  K(x)  =  K(-x)  ,  lira  xK(x)  =  0  ,  and  /K(x)dx  =  1  . 

|x|-*» 


B  n 


f„<*>  '  Bn/K(Bn(u-x))dFn(u)  -  f  £  K(Bn(Xrx)) 


and 


R*(x)  =  /  f  (u)du  , 
n  x  n 


then 
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Z  (x)  =  f  (x)/R  (x)  and  Z*(x)  =  f  (x)/R*(x)  . 
n  n  n  n  n  n 

If  Xq  Is  a  continuity  point  of  both  F  and  f  ,  then  the 

estimators  Z  (x.)  and  Z*(x_.)  are  consistent  and  asymptotically 
n  u  n  U 

normally  distributed. 

Recent  literature  on  the  topic  of  this  survey  has  received 
its  impetus  from  the  work  of  Rice  and  Rosenblatt  (1976),  who  propose 

three  estimators,  h^  ,  h^  ,  and  h^  of  h  .  These  estimators 

n  n  n 

are  similar  to  those  of  Watson  and  Leadbetter,  and  Murthy,  except  for 
the  fact  that  the  modified  sample  distribution  function  (the  usual 
sample  distribution  function  multiplied  by  n/(n+l))  is  considered. 

They  consider  in  good  detail  both  the  bias  and  the  covariance  proper¬ 
ties  of  ■.  rpressions  which  agree  with  these  estimators  up  to  a  term  which 
is  negligible  in  order  of  magnitude  with  large  probability. 


Rict  and  Rosenblatt  let  H(x)  =  -£n(l-F(x))  ,  H  (x)  = 

n 

-£n(]~F  (x>)  ,  and  denote  their  6-function  sequence  by  {to  (x) }  , 

n 

where  ^(x)  =  >  with  0)  a  kernel  which  is  bounded,  band- 

n  n  '  n' 

limited  (i.e.,  oj(x)  =  0  ,  |x|  >_  A  for  some  positive  constant  A  ), 

symmetric,  and  of  integral  one,  and  b  +0  with  nb  ■+•  00  ,  as  n-*00  . 

n  n 

An  estimate  of  the  underlying  density  function  f  is  given  by 

f  (x)  =  /a)  (x-u)dF  (u)  . 
n  n  n 

Their  estimates  of  the  failure  rate  function  h(x)  are 


£n<*> 
!-Fn(x)  * 


dF  (u) 


(•y\  n v  '  1 

’  (x)  =  /u)  (x-u)  n  ■=—.  .  *  \  (a)  (x-X...)  - 7TT 

n  n  1-Fn(u)  n  (j)  n-j+1 


and 
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hn3)(x)  =  ^n(x-u)dHn(u)  =  a,n(x-X(.))£n[l+;r^iT]  . 


For  the  estimate  h^^(x)  ,  Rice  and  Rosenblatt  show  that  if 
n 


FeC  (i.e.,  F  is  continuously  differentiable),  then 

h< 

where 


'  V*>  (>  +  <>,(£)) 


f  (x) 

an(x)  "  FFOO  • 


Furthermore,  if  FeC  ,  then 


E(anOO)  =  h(x)  +  -y  /  u2oi(u)du  +  o(b2)  ; 

also,  a  (x)  is  asymptotically  normally  distributed, 
n 


Thus,  the  leading  bias  term  of  a^(x)  ,  where  an(x)  agrees 
with  h^\x)  up  to  a  term  which  is  negligible  in  order  to  magnitude 
with  large  probability,  is  proportional  to  f"(x)/(l-F(x))  . 

(2)  2 

For  the  estimate  h  (x)  ,  if  FeC  ,  and  if  x  is  such  that 
n 

F(x)  >  0  ,  then 

b2 

E[h^2^(x)1  =  h(x)  +  ~  h"(x)  /  u2co(u)du  +  o(b2)  , 

L  n  J  l  n 

(2) 

implying  that  the  leading  bias  term  of  h^  (x)  is  proportional  to 
h" (x)  . 

(3) 

In  order  to  study  the  estimator  h^  (x)  ,  Rice  and  Rosenblatt 

(2)  (3) 

consider  the  difference  between  h  (x)  and  h  (x)  ,  and  show  that 

n  n 

it  is  negligible  in  order  of  magnitude  with  large  probability.  That  is. 
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h<2>w  -  „»><„ 


Thus,  like  h^^(x)  ,  the  leading  bias  term  of  h^^(x)  is  also  pro¬ 
portional  to  h"(x)  . 


Using  the  fact  that  the  leading  bias  term  of  h^(x)  is  pro- 

n 

portional  to  f"(x)/(l-F(x))  ,  it  can  be  shown  that  the  bias  of  h^(x) 

n 

(3) 

and  h^  ' (x)  is  larger  if  f'(x)  =  0  and  f"(x)  >  0  ,  or  if  f  is 
almost  constant  near  x  ;  whereas  the  bias  of  h^^(x)  is  larger  if 
h' (x)  =  0  and  h"(x)  >  0  ,  or  if  h  is  almost  constant  near  x  . 


Rice  and  Rosenblatt  also  consider  the  interesting  case  of  the 
observations  X^,...,Xn  being  dependent.  Specifically,  if  the  observa¬ 
tions  constitute  a  stationary  time  series  which  satisfies  some  mixing 

(regularity)  conditions,  then  the  estimates  h^  ,  h^  and  h^ 

'  n  n  n 

are  asymptotically  equivalent  to  a  (x)  =  f  (x)/(l-F(x))  . 

n  n 


The  main  contribution  of  Rice  and  Rosenblatt  (1976)  is  to  obtain 
a  global  result  on  the  estimation  of  the  failure  rate,  specifically  on 
the  maximal  weighted  deviation  between  an  estimate  of  h  ,  and  h  .  They 
have  accomplished  this  by  directly  applying  the  results  of  Bickel  and 
Rosenblatt  (1973)  on  the  estimation  of  the  density  function  f 
[strengthened  by  Rosenblatt  (1976)  using  the  recent  results  of  Komlos, 
Major,  and  Tusnady  (1975)].  Specifically,  they  obtain  the  asymptotic 

distribution  of  the  global  weighted  deviation  of  h^(x)  from  h(x)  , 

n 

on  any  finite  interval  whose  length  grows  with  the  sample  size  n  and 
which  diverges  (tends  to  infinity)  as  n  tends  to  infinity. 

Consider  a  sequence  (£(n)}  ,  where  £(n)  denotes  the  length  of 
a  finite  interval,  and  where  £(n)  -*  °°  as  n  -+•  00  ,  but  in  such  a  way 

“6 

that  log  £(n)  =  0(n)  ,  and  let  b  =  n  ,  for  0<6<%  .  Let 


-  18  - 


T-422 


M  =  max 
n  |x|<t(n> 


(nbnf-1(x))  (1- F(x) ) (h^X) (x)-h(x)) 


.(1), 


denote  the  maximal  weighted  deviation  of  (x)  from  h(x)  ,  weighted 

-1  % 

by  (nb  f  (x))  (l-F(x))  ,  over  the  interval  £(n)  .  Under  some  regu- 
n  ' 

larity  conditions  on  £(n)  ,  and  the  additional  assumption  that 


sup  (l-F(x))"1  =  o(n^)  , 
lx|<£(n) 


one  has  the  result  that 


lim  P 

n-K» 


(2  log  c(n))  2 


M 


,(Mto)) 


b  "  V  5-  x 


=  e 


-2e 


-x 


where 


and 


c(n)  =  2i(n)/b  ,  A(to)  =  /to  (u)du  , 

n 


(2  log  c(n))  2  + 


K,  (to) 

log  -  +  —  log  log  c (n) 

/n 


(  2  log  c(n>)  2 

“  V“>  ,  >  0  ; 


d  -  -f 
n 


(2  log  c  (n) )  2 


(2  log  c(n)) 


r.  1V11 

k  Ll0g  TT  2  J 


if  ,  with  K^Cto)  = 


I  J~[to^(u)  l^du 


X(to) 


The  above  global  result  can  be  used  to  construct  uniform  confi¬ 
dence  bands  for  the  failure  rate  function  h(x)  using  the  estimator 

h^(x)  .  A  100 £%  uniform  confidence  band  for  h(x)  ,  for  |x|  <  £(n) 
n 

is  given  by 
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h11'^ 

ll 


('("))’  I  V 

'  (.x  ))  (I  1(0)  |_l  !«•>;  i'  (it)  ) 


with  y  =  -log(-«  pp)  . 


Rice  and  Rosenblatt  undertake  some  simulations  to  give  some  indi¬ 
cation  of  the  performance  of  the  estimator  h^  and  its  limiting  dis- 

n 

tribution,  and  claim  the  results  to  be  encouraging  for  large  sample 
sizes  with  narrow  bandwidths. 

(2)  (3) 

Global  results  for  the  estimators  h  and  h  and  the  re- 

n  n 

suiting  uniform  confidence  bands  have  been  obtained  by  Sethuraman  and 
Singpurwalla  (1981)  via  considerations  of  a  "naive  estimator"  of  h  . 
This  estimator,  which  is  shown  to  be  uniformly  asymptotically  equivalent 

(?)  (3) 

to  h  and  h  ,  will  be  discussed  later,  in  Section  6. 
n  n 


5.1  Improvement  of  Kernel  Estimates — Use  of 
the  Generalized  Jackknife 


Singpurwalla  and  Wong  (1980a)  attempt  to  reduce  the  bias  and 
improve  upon  the  rate  of  convergence  of  the  mean  square  error  (MSE)  of 
kernel  estimators  by  considering  estimators  of  the  type 


1(X0)  b  £  n-j+1 

n  3=1 


^(i)~V 


where  the  sequence  {b  }  is  such  that  b  10  and  nb  -*■  00  ,  as 

n  n  n 

n  -*■  00  ,  and  K  is  a  bounded  and  symmetric  kernel  of  integral  one  with 

the  additional  properties  that  /jK(x)  |dx  <  00  (i.e.,  KeL^)  ,  and 

lim  IxK(x) I  =  0  .  Note  that  h  (x„)  is  identical  to  (5.2),  the  esti- 
_ _  n  U 

X-K” 

mator  h  of  Watson  and  Leadbetter  (1964a,  1964b),  and  the  estimator 

h^  of  Rice  and  Rosenblatt  (1976). 

n 
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The  authors  first  prove  a  "pointwise  saturation  theorem,"  which 

says  that  for  the  class  of  nonnegative  L1  kernels  K  ,  and  with  the 

2  12  2 

additional  requirements  that  x  KeL  ,  lim  nb  (n)  =  00  ,  heC  ,  and 

n-K» 

(3) 

h  (the  third  derivative  of  h  )  being  uniformly  bounded,  the  rate  of 

-4/5 

convergence  of  the  MSE  of  h^(xg)  is  at  most  0(n  )  ,  regardless  of 

the  smoothness  of  h  .  If  the  condition  that  the  kernel  K  be  nonnega¬ 
tive  is  relaxed,  that  is,  if  the  kernel  is  allowed  to  take  some  negative 
values,  then  the  bias  contribution  to  the  asymptotic  MSE  can,  in  princi¬ 
ple,  be  eliminated  to  any  desired  order,  and  the  rate  of  convergence  of 

the  asymptotic  MSE  given  by  0(n  2m/(2m+l)^  for  some  m>3  ,  can  be 

brought  as  close  to  n  ^  as  is  desired.  It  is  shown  that  an  indefinite 
use  of  the  "generalized  jackknife  method"  of  Gray  and  Schucany  (1972) 
can  be  used  to  achieve  this  goal.  Furthermore,  by  eliminating  the  re¬ 
quirement  that  K  £  ,  one  is  able  to  consider  kernels  for  which  the 

rate  of  conveggence  of  the  MSE  of  h  (x.)  is  not  as  slow  as 

n  (J 

0(n  2m/(2m+l)^  ^  One  possible  non-L^  kernel  is  the  "sine  function 
kernel,"  K(x)  =  X  ,  considered  by  Singpurwalla  and  Wong  (1980b). 

If  the  Fourier  transform  of  h  decreases  exponentially,  then  the 

asymptotic  MSE  of  h  (x„)  using  the  sine  function  kernel  decreases 

n  U 

at  the  rate  logn/n  ,  which  is  very  close  to  the  ideal  rate  of  n  ^  , 
which  cannot  be  achieved  in  practice. 


6.  The  Naive  Estimator 


The  "naive  estimator*"  of  h  was  originally  proposed  by 
Grenander  (1956b),  and  has  been  considered  by  Marshall  and  Proschan 
(1965)  and  by  Barlow,  Bartholomew,  Bremner,  and  Brunk  (1972,  Section 


*Naming  the  estimator  naive  is  due  to  Frank  Proschan. 
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5.3)  for  estimating  a  monotonical ly  increasing  (or  decreasing)  function 
li  .  It  lias  been  used  by  Singpurwalla  (1975)  for  performing  a  time 
series  analysis  of  failure  data. 

Following  the  notation  of  Section  4.1,  let  ,  n=l,...,N  ,  be 

n 

the  true  survival  time  of  the  nth  item,  assumed  to  be  independent  iden¬ 
tically  distributed  random  variables  having  a  common  but  unknown  distri¬ 
bution  .  Let  Y  ,  n=l,...,N  ,  be  the  withdrawal  time  of  the  nth 

n 

item,  which  may  or  may  not  have  a  joint  distribution  H  ,  but  which  must 
be  independent  of  the  X^'s  .  As  before,  let 

X  =  min(X°  ,  Y  )  ,  n=l, . . . ,N  , 
n  n  n 

and  interest  centers  arount  the  distribution  function  . 


Suppose  that  k  failures,  (k<N)  have  been  observed  in  all, 

and  let 

0  "=  z(0)^z(i)i-iz(k) 

be  the  ordered,  failure  times.  Note  that  X^  is  a  failure  time  whenever 

X  =  X°  .  The  total  time  on  test  at  time  t  ,  T„(t)  ,  is  defined  as 
n  n  Nx 

t 

T  (t)  =  /  N(h»  Uiw  , 

0 

where  N(u)  is  the  number  of  items  which  are  surviving  at  time  u  . 

The  naive  estimator  of  h(z)  ,  h*(z)  ,  is  defined  as 

N 


h*(z) 


Vz<i)>  -  VVi)>  '  Za-1>— ZW>  •  1-1 . k 


Z-Z(k)  • 


Note  that  h*(z)  is  the  reciprocal  of  the  total  time  on  test  in  the 
interval  (Z(i  l)  ,  Z(i) ]  . 
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When  k  =  N  ,  that  is,  when  there  is  no  censoring,  =  X^^  , 

V  i  ,  where  X?, .  <  <  —  <  X,,,.  are  the  order  statistics  of 

(i;  •  (l)  -  —  (N) 

X?, — ,xP  .  In  this  case,  for  X,.  1X<  z  <  X*?.,.  ,  the  naive  estimator 

1  N  (i-1)  (i) 

becomes 


h*(z)  =  {Total  time  on  test  in  (X^  ^  ,  X^^  ]  }  ^ 
-  [(n-i+l)(X®i) -X°±  1))]"1  . 


Sethuraman  and  Singpurwalla  (1980)  show  that  the  above  estimator 
is  asymptotically  unbiased,  but  is  not  consistent,  since  it  has  a  limit¬ 
ing  nondegenerate  distribution.  Furthermore,  since  for  any  m  distinct 

time  points  z,,...,z  the  estimators  (h*(z, ) , . . . ,h*(z  )}  are  asymp- 
im  NINm 

totically  independent,  the  graph  of  (h*(z)  ,  z^O}  will  exhibit  wild 

fluctuations,  prohibiting  the  use  of  h*  as  an  estimator  of  h  .  This 

N 

behavior  of  h*  is  analogous  to  the  behavior  of  the  sample  periodogram 

which  is  used  to  estimate  the  spectrum  of  a  stationary  time  series. 

Thus,  following  the  standard  technique  of  "smoothing"  the  sample  period¬ 
ogram,  Sethuraman  and  Singpurwalla  smooth  the  naive  estimator  h*  using 

a  bandlimited  k». •«  rel,  to  obtain  a  consistent  and  an  asymptotically  nor¬ 
mal  estimate  of  the  failure  rate. 

The  smoothed  estimator  h*(z)  obtained  by  smoothing  the  naive 
estimator  h*(z)  using  a  kernel  of  bandwidth  2b^A  is,  for  z^bj^A  , 
given  by 

h*(z)  =  ^  /  u^h*(s)ds  , 

where  the  kernel  w  and  the  sequence  {b^}  have  the  properties  re¬ 
quired  by  Rice  and  Rosenblatt  (1976)  (see  Section  5). 
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The  estimate  h*(z)  is  consistent,  and  if  the  first  derivative 
N 

u)'  of  to  is  such  that  to'  e  L'*'  ,  then  h*(z)  is  uniformly  close  to 

N 

h  (z)  of  Rice  and  Rosenblatt  on  bounded  intervals  of  length  K  , 
n 

where  K  is  such  that  K+A  <  .  Furthermore,  under  some  regularity 

--  (N; 

conditions  on  F  ,  h*(z)  can  be  approximated  by  an  appropriate  Gaussi- 

2 

an  process.  Under  the  additional  requirements  that  heC  , 

2  1  5 

(to'(z))  e  L  ,  and  that  Nb„l°gb  4-0  as  n-*-00  ,  the  asymptotic  dis- 

N  n 

tribution  of  the  global  deviation  of  h*(z)  from  h(z)  can  be  obtained 

on  any  finite  interval  tbNA  ,  K]  ,  in  much  the  same  way  as  in  Rice  and 

Rosenblatt  [see  Theorems  2.2  and  2.4  of  Sethuraman  and  Singpurwalla 
(1981)].  These  results  can  be  used  to  construct  the  100e%  confi¬ 
dence  bands  for  h(z)  ,  for  b  A£z<_K  5  these  are 


h*(z)  ± 


hg(z)  Vs 

‘NVn(z)^ 


ft  •  ■■) 


(6.1) 


where  Y  =  -log(- (?i)  loge)  ,  F^  =  1-FN  ,  with  F^  being  the  empirical 
distribution  function  of  £  X(2)  =  •'*  =  X(N)  ’  ant*  for  CN  = 

(21og(K/bN))'5  , 


(A(o>)) 


CN  +  log  [C. 


1  CN  ’ 


if  ^((jo)  >  0  , 


[4  +  log  (^)]  /  cn  . 


if  K-^to)  <  0  ; 


A(u>)  ,  K1(ai)  ,  and  x2((a)  have  been  defined  in  Section  5. 
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(3) 

Since  h*(z)  is  uniformly  close  to  h  (z)  ,  and  since 
N  n 

h^\z)  -  h ^\z)  =  0  ( l/a )  ,  the  I00c%  uniform  confidence  bands  on 
n  n  p 

h(z)  can  also  be  obtained  by  replacing  h*(z)  in  Equation  (6.1)  by 
the  Rice  and  Rosenblatt  estimators  h^(z)  or  h*^(z)  . 


6.1  Random  Smoothing  of  the  Naive  Estimator 

It  is  to  be  emphasized  that  the  results  of  Watson  and  Leadbetter 
(1964a,  1964b),  Murthy  (1965),  Rice  and  Rosenblatt  (1976),  and 
Sethuraman  and  Singpurwalla  (1980)  apply  only  when  there  is  no  censor¬ 
ing,  when  the  sample  size  becomes  large,  and  under  a  host  of  regularity 
conditions  on  h  .  The  above  conditions  are  not  always  easy  to  satisfy 
in  practice,  and  so  in  an  attempt  towards  eliminating  them.  Miller  and 
Singpurwalla  (1980)  consider  a  randomly  smoothed  estimator  hN  of 

h  ,  defined,  for  some  j  and  i=j , j+1 , . . . ,k  ,  as 

‘VW  -  Wn”'1  •  Vi><ziz(i) 

o  ,  z>z(k)  ; 

note  that  when  j=l  ,  h*  =  h*  ,  the  naive  estimator. 

W  ,  J  N 


hS,jw  ■ 


Alternate  expressions  for  h*  ^  in  terms  of  h*  ,  for 
i=j , j+1, . . • > k  .  are 

j-1 


-  7  l 


j  m-0  hS(Z(l-m)>  ’ 


'(i-l)<Z=Z(i)  ’ 


■Z(i) 


h*(u)dTN(u) 


hN,j(z) 


:a-ii 


/ 


Z(i) 


*  Z(i-D<Z^Z(i)  ’ 


dTN(u) 


'(i-j) 


F  j.' 
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Since  the  naive  estimator  h*  is  constant  over  a  failure  inter- 

N 

val,  it  can  be  viewed  as  some  kind  of  smoothing  over  a  single  failure 

interval.  If  this  notion  of  smoothing  is  extended  to  cover  j  failure 

intervals,  then  h*  .  results;  this  is  a  motivation  for  h*  .  . 

N,j 

A  reason  for  considering  h*  .  is  that  it  leads  to  an  exact 

N.J 

distribution-free  confidence  region  in  the  case  of  finite  samples,  with 
censoring.  Also,  its  asymptotic  theory  does  not  involve  the  usual  regu¬ 
larity  conditions.  Furthermore,  h*  .(z)  is  a  consistent  estimator  of 

N,J 

f(z)  at  every  continuity  point  z  of  h  .  A  disadvantage  of  h*  .  is 

N,J 

that  it  yields  confidence  limits  on  a  smoothed  version  of  h  ,  h*  , 

N,j 

where 


h(u)dTN(u) 


h*  (z)  ~  x  (Z  )  -  t  (Z  )  ’  <  z  =  Z(i)  * 

n, j  V  (i)'  VZ(i~j)'  {  }  U> 

rather  than  h  itself.  This  disadvantage  is  of  concern  only  if  h 
changes  too  rapidly. 

Specifically,  a  100e%  upper  confidence  bound  for  h*  (z)  , 
Cot  zd-i)<ziz<D  -  is  N’:' 


1+  i 


where  C.  ,  „  is  the  critical  value  such  that 

J.k.e 


P  sup  l  (V -1)  <  cT 


j<i<k  j-i-j+1 


j  »k, £ 


=  1-e  , 


and  where  the  are  independent  and  identically  distributed  random 


variables  with  density  e 
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The  lower  and  the  two-sided  confidence  hounds  for  li*  ran  he 

N , 

similarly  obtained,  in  terms  of  C  and  C.  ,  ,  respectively, 

3,k.,c 

where  C  ,  and  C.  ,  are  the  inflmum  and  the  supremum  of  the 

i 

absolute  value  of  £  (U  -1)  ,  respectively. 
m=i-j+l  m 

At  present  there  do  not  exist  any  analytical  methods  for  comput¬ 
ing  C+  ,  ,  C  ,  and  C.  ,  ;  however,  these  can  be  easily  ob- 

J  }  K. ,  G  J,K.,G 

tained  via  a  Monte  Carlo  method,  for  any  desired  values  of  j  ,  k  ,  and 

£  .  The  need  for  such  a  Monte  Carlo  for  every  required  combination  of 

j  ,  k  ,  and  c  could  be  viewed  as  another  disadvantage  of  using  h*  .  . 

N » J 

Miller  and  Singpurwalla  (1980)  also  develop  the  asymptotic  theory 
for  h*  j  ,  as  N  +  °°  with  j/N+il  ,  0<  £<  1  ,  and  present  the  confi¬ 

dence  bounds  for  h*  based  on  this  theory.  However,  in  the  light  of 

N,j 

the  results  of  Watson  and  Leadbetter  and  Sethuraman  and  Singpurwalla, 
there  are  no  advantages  in  using  these  bounds,  and  so  they  are  not 
discussed  here. 

7 .  The  Generalized  Failure  Rate  Function 
and  Its  Estimation 


Barlow  and  Van  Zwet  (1970)  generalize  the  definition  of  the  fail¬ 
ure  rate  function  by  considering  a  known  absolutely  continuous  distribu¬ 
tion  function  G  with  density  function  g  ,  and  define  the  generalized 
failure  rate  function  of  F  ,  for  all  x>0  such  that  g[G  1F(x)]  >  0  , 
by 


r(x)  = 


f(x)  . 
8 [G-1F (x) ] 


Note  that  when  G  is  taken  to  be  the  exponential  distribution  with 
parameter  r=l  ,  then  r(x)  *  h(x)  ,  the  failure  rate  function  given  by 
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(2.1),  whereas  if  G  is  taken  to  be  the  uniform  distribution  on  [0,1]  , 
then  r(x)  =  f(x)  ,  the  density  function  of  F  .  A  motivation  for  con¬ 
sidering  the  generalized  failure  rate  function  is  not  within  the  scope 
of  this  survey,  and  is  given  by  Barlow,  Bartholomew,  Bremner,  and  Brunk 
(1972).  For  now  it  suffices  to  know  that  h  is  a  special  case  of  r  . 

Ahmad  (1976)  considers  a  kernel  estimate  of  f(x)  , 

{(x)  .  J_  1  (u)  -  P  j  K&ii)  , 

n  \  n  /  n  j=l  \  n  / 

where  the  kernel  K  and  the  sequence  {b  }  satisfy  some  regularity 

n 

conditions.  He  introduces  the  following  kernel  estimate  of  r(x), 

r(x) - ^ -  , 

g [G  V(x))] 

X 

A  A  A 

where  F(x)  =  f  f(u)du  ,  and  shows  that  r(x)  is  consistent. 

—00 

Shaked  (1978)  proposes  the  following  estimator  of  r(x)  ,  for 
any  fixed  x  : 


F  (x  +  4  C  -F  x-Jc 
_  x  x  n\  2  n/  n\  2  n 

p  (x)  =  - — 7-f-: - - = - : - *r-  , 

where  C  is  a  positive  constant,  R(x)  =  li  :  x-4  C  <  X...  <  x  +  -^C  >  , 

n  (  2  n  =■  (l)  =  2  n ) 

and  M(x)  *  number  of  X...  :  X,..  >  x  +  t~  C 

(i)  (i)  ■==  2  n 

To  gain  some  insight  into  p^(x)  »  note  that  if  G  is  the  expo¬ 
nential  distribution,  then  the  estimator  is  simply  the  ratio  of  the 
(normalized)  number  of  failures  in  the  interval  [x-y  ,  x  +  y  C^]  to 
the  total  time  on  test  in  the  interval. 
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When  C  =  cn  ,  where  c  >  0  and  0  <  a  <  1  ,  and  if  r  is 
n 

smooth  in  the  neighborhood  of  x  ,  Shaked  (1978)  shows  that  [Wx)  is 
asymptotically  normal.  In  particular,  for  l/5<a<l  , 

n(l-a)/2  ^p^(x) _  r(x) 

has  an  asymptotic  standard  normal  distribution,  whereas  when  1/7  < 
a  <  1/5  , 


/cf (x) 
r  (x) 


Vcf (x) 
r(x) 


(l-a)/2 

n 


Pn(x) 


r(x)  - 


C^rOO 

24f (x) 


f"(x)  - 


(x)r2(x)R1(G~1F(x))\ 
f(x)  / 


has  an  asymptotic  standard  normal  distribution.  Note  that  the  above 

expression  is  an  extension  of  (5.4)  applied  to  p  (x)  . 

n 


Barlow  and  Van  Zwet  (1969,  1970)  and  Shaked  (1978)  also  consider 
methods  of  estimating  r(x)  when  some  monotonicity  conditions  are  im¬ 
posed  on  it.  However,  these  are  not  given  here,  since  they  do  not  be¬ 
long  to  the  purview  of  this  survey. 
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